Tri-level handwritten text segmentation techniques for Gujarati language

نویسندگان

چکیده

Objectives: To improve the efficiency of tri-level segmentation tasks for handwritten Gujarati text. Methods: Using hybrid methods segmentation, we have used line, word and character from image. This study presents a paradigm that works with touching characters, slop line written on page, overlapping, etc. It evaluated dataset 500+ images created by us different writing sentences people. We Horizontal projection technique Scale-space Vertical segmentation. Findings: The experimental results show proposed method is more efficient text diacritics. obtained accuracy level 82%, word-level 90% line-level 87%. Novelty: designed methodology to segment diacritics at all three levels including words lines. Applications: which pre-processing task can be in any recognition systems i.e. OCR. Keywords: Deep learning; trilevel segmentation;

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A perceptive method for handwritten text segmentation

This paper presents a new method to address the problem of handwritten text segmentation into text lines and words. Thus, we propose a method based on the cooperation among points of view that enables the localization of the text lines in a low resolution image, and then to associate the pixels at a higher level of resolution. Thanks to the combination of levels of vision, we can detect overlap...

متن کامل

Segmentation of Bangla Unconstrained Handwritten Text

To take care of variability involved in the writing style of different individuals in this paper we propose a robust scheme to segment unconstrained handwritten Bangla texts into lines, words and characters. For line segmentation, at first, we divide the text into vertical stripes. Stripe width of a document is computed by statistical analysis of the text height in the document. Next we determi...

متن کامل

Research of Chinese Handwritten Text Segmentation Algorithm

OCR is a complicated process, there are many factors that can influence the recognition rate. Early period people tried to optimize the classifier to obtain high recognition rate, but the premise is that there is only one character no matter print or handwritten. For the performance of classifier has been promoted a lot, recognition rate for single character is high enough for commercial use. W...

متن کامل

Segmentation of Handwritten Gurmukhi Text into Lines

Text line segmentation is an essential pre-processing stage for handwriting recognition in many Optical Character Recognition (OCR) systems. It is an important step because inaccurately segmented text lines will cause errors in the recognition stage. Text line segmentation of the handwritten documents is still one of the most complicated problems in developing a reliable OCR. The nature of hand...

متن کامل

Clustering Algorithm for Gujarati Language

Natural language processing area is still under research. But now a day it is on platform for worldwide researchers. Natural language processing includes analyzing the language based on its structure and then tagging of each word appropriately with its grammar base. Here we have 50,000 tagged words set and we try to cluster those Gujarati words based on proposed algorithm, we have defined our o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Indian journal of science and technology

سال: 2021

ISSN: ['0974-5645', '0974-6846']

DOI: https://doi.org/10.17485/ijst/v14i7.2146